7. Acknowledgment 8. References Cervical Cell Data 6. Concluding Remarks
نویسنده
چکیده
Non-parametric decision rules, such as the nearest neighbor (NN) rule, are attractive because no a priori knowledge is required concerning the underlying distributions of the data. Two traditional criticisms directed at the NN-rule concern the large amounts of storage and computation involved due to the apparent necessity to store all the sample (training) data. Thus there has been considerable interest in “editing” or “thinning” the training data in an attempt to store only a fraction of it. Previous editing algorithms suffered from the drawback that they delivered edited sets that were not decision-boundary consistent, i.e., the decision boundary determined by the edited set differed from that specified by the entire original training data. In this paper several geometric methods based on proximity graphs are proposed for editing the training data for use in the NN-rule. Most notably, one of the methods yields a decision-boundary consistent edited set and therefore a decision rule that preserves all the desirable convergence properties of the NN-rule that is based on the original entire training data. The methods are all derived from the Voronoi diagram of the sample data and make use of subgraphs of the Delaunay triangulation. The methods are compared empirically through experiments on synthetic data as well as real world data in the automatic detection of cervical cancer. Finally, algorithms for the efficient implementation of these techniques are discussed.
منابع مشابه
A Panel Data Study of Physicians’ Labor Supply: The Case of Norway
.................................................................................... 4 SAMMENDRAG ................................................................................. 5 INTRODUCTION................................................................................ 7 Background .................................................................................................. 8 Economet...
متن کاملErgodic Theory: Nonsingular Transformations
Glossary 1 1. Definition of the subject and its importance 2 2. Basic Results 2 3. Panorama of Examples 8 4. Mixing notions and multiple recurrence 10 5. Topological group Aut(X,μ) 13 6. Orbit theory 15 7. Smooth nonsingular transformations 21 8. Spectral theory for nonsingular systems 22 9. Entropy and other invariants 25 10. Nonsingular Joinings and Factors 27 11. Applications. Connections wi...
متن کاملBiosynthesis of Cholesterol and Other Sterols
4.5. C24-Alkylation Reduction Bifurcation in Phytosterol Synthesis 6433 5. Sterol Enzyme Action 6434 5.1. C24 Methylation 6434 5.2. C24-Reduction 6438 5.3. Removal of Nuclear Methyl Groups at C4 6439 5.4. Removal of Nuclear Methyl Group at C14 6441 5.5. Shift of Δ to Δ-Position 6442 5.6. 9β,19-Cyclopropane Ring Opening 6444 5.7. C22 Desaturation 6444 6. Concluding Remarks 6445 Author Informatio...
متن کاملBoundedness and K for Log Surfaces
0. Introduction 1 1. Standard definitions 3 2. Examples 4 3. Some methods for proving boundedness 8 4. Additional definitions and easy technical results 9 5. The diagram method 10 6. Boundedness for surfaces with nef −(K+B) 14 7. Boundedness for surfaces with big and nef K +B 18 8. Descending Chain Condition 23 9. Boundedness for the constant (K +B)2 27 10. On log MMP for surfaces 28 11. Conclu...
متن کاملDevelopmental exposure to nicotine at different neonatal ages Analysis of protein tau in the cerebral cortex
3 Introduction 4 Exposure to toxic agents present in our environment 4 Brain development 4 Cholinergic system 5 Nicotine receptors 6 Muscarinic receptors 7 The tau protein 7 Nicotine 8 Aims 9 Materials and Method 10 Treatment 10 Protein analysis 10 Statistical analysis 11 Results and Discussion 12 Concluding remarks 19 References 20
متن کامل